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(57) Abstract 

A content addressable memory ("CAM") engine (100) or controller interfaces between a host signal processor (e.g.. a microprocessor) 
and a pluraHty of krjown, commerciaUy^vailable random access memory ("RAM") devices. The CAM engine (100) configures the RAM 
as content addressable memory, thereby causing the nonnally locaUon-addrcssd RAM to function as CAM. The CAM engine (100) thus 
allows to the benefits of both RAM and CAM devices, such as speed, density, cost and intuitiveness, without their inherent drawbacks. 
Further, the CAM engine (100) implements various flexible memory storage configurations for the keys and associations stored in RAM. 
Also, the C:AM engine (100) implements certain algorithms that provide for the hashing of data, for table load and unload capabilities, for 
proxhnity matching, for dealing with overflow conditions, and for implementing hiwarchical search capabiliaes. 
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Description 

CONTENT ADDRESSABLE MEMORY (CAM) ENGINE 
Technical Field 

This invention relates generally to computer memory storage devices and 
5 corresponding controllers, and more particularly to a controller for associative 
memory or content addressable memory devices in which the contents of well- 
known random access memory ("RAM**) devices are retrieved not by use of 
location addresses, but by use of *1ceys" associated with the contents of 
corresponding memory locations. 

10 A computer typically comprises a signal processor, which operates 

according to defined instructions. The computer also includes a memory storage 
device that stores the instructions for the signal processor along with data utilized 
by the processor for various purposes. The most common type of memory device 
is RAM, which stores data at particular locations normally defined by specific 

15 addresses. To either store or retrieve data in RAM, the signal processor must 
supply the specific address of the desired memory location to a memory address 
register associated with RAM. All memory locations within RAM are truly 
randomly-accessible in that the processor can access any location independently of 
all other locations in a specific period of time that is constant for all memory 

20 addresses. 

ARAM device typically comprises a matrix of memory locations arranged 
by rows and columns. As such, these locations are indexed by row and column 
numbers. TMs arrangement of memory locations allows the RAM device to be of 
relatively high density and low cost. Modern RAM devices are also relatively fast 

25 in implementing data storage and retrieval operations. These factors contribute 
greatly to the hrgt popularity of RAM devices. 

With RAM, the number of discrete, addressable locations is essentially 
limited by the number of address lines provided with the memory device. For 
example, sixteen individual address lines allow for or 65536, separate and 

30 distinct memoiy locations to be addressed within a RAM integrated circuit. 

Modem commercially-available RAM devices may have at least one million (/.e., 
"1 Meg") or more of individually-addressable memory locations. 
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On the other hand, generally the bit size or width of RAM locations varies 
greatly between commercially-available devices. The width of each memory 
location is typically determined, in part, by the width (i.e., number of bits) of the 
data bus employed by the computer system. For example, for a 16-bit wide data 
5 bus, the wdth of a RAM memory location may be 16 bits or may be 64 bits (i.e., 
four times the 16-bit word size). In the alternative, the word size may be only one 
bit ^de. These sizes are purely exemplary. The word size depends primarily on 
the chosen application for the RAM deface. 

RAM devices are generally divided into two categories according to their 

10 electrical operatmg characteristics: static and dynamic. A static RAM ("SRAM") 
device is constructed internally such that once data have been written into the 
locations, the logic states of the data are maintained as long as electrical power is 
applied to the SRAM device. On the other hand, a dynamic RAM ("DRAM") 
de>dce requires constant refreshing of its internal circuits to maintam the logic 

IS states of the stored data in the memory cells over time. 

Despite their popularity, RAM devices have inherent limitations. The 
primaiy limitation involves accessing stored data using a location-address method. 
For example, if it is desired to access one specific piece of data, then, for fitstest 
data retrieval, the corresponding RAM location must be known. This requirement 

20 stems from the fact that the RAM location address has no logical relationship to 
the stored data. In RAM, the address is merely an artificial construct. If the 
specific RAM address is unknown (which is often the case), the processor must 
employ some type of search process to locate the desired piece of data. With 
RAM, such searclung is typically carried out sequentially, one location at a time. 

25 This sequential searching drawback is magnified in certain software 

applications that store a large number of items in a data structure such as a table. 
Many modem software applications are table driven. Software developers have 
embraced this approach because the resulting software is flexible, understandable 
and readily maintainable. Generally, these applications consume considerable 

30 processor resources. For example, simulation applications are generally table 
intensive, with a reputation for consuming large amounts of processor time. 
Unfortunately, this means that the signal processor is spending a great deal of time 
just finding data entries in the tables. 

More specifically, when it is desired to search through, correlate and/or sort 

35 a table of data, the signal processor must sequentially scan large blocks of RAM 
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locations to find and/or position certain data during these operations. This serial 
data processing function is required since a RAM device generally has no 
capability for scanning, correlating and/or sorting its entire contents in parallel. 
For example, in a data table implemented in RAM and containing a list of 
5 records (e.g., a list of people), with each record having several data fields (e.g., 
names, addresses, phone numbers), the signal processor must specify the exact 
RAM address to find the desired data in the table. Alternatively, the processor 
could run software that sequentially searches the entire table for a desired field. 
As compared to a location address, the concept of a "field" has somewhat more of 

10 a logical relationship to the stored data. Yet, because RAM generally lacks a 
parallel scanning ability built into the hardware logic on the RAM device, the 
signal processor must sequentially search the entire table to locate the desired 
data. Sequential searching is extremely time-consuming when utilizing RAM 
devices to store databases. In some simple data capture applications, the tedious 

15 implementation of software lookup algorithms represents a significant portion of 
the application's complexity. The inherent RAM "bottleneck" problem often 
means that the speed of the RAM device, in storing and accessing data, is the 
limiting speed factor in the overall computer system. This is becoming more 
evident as recent speed improvements in ^gnal processors have advanced past 

20 speed improvements for memory devices. 

As a way of solving the inherent performance penalty associated with the 
sequential processing of data tables stored in RAM devices, it has long been 
desired to make memory devices more intuitive in terms of storing and accessing 
data. That is, it is desired to provide a memory device that functions in more of an 

25 associative manner. This is akin to human memory, where stored abstract 

information is referenced not by addresses, but by some other logically-related 
abstract information. Ideally, the associative memory should also compare to 
RAM in terms of speed, cost and density. 

Various solutions have been proposed to make computer memory devices 

30 more associative in nature while retaining speed, cost and density advantages. 

These solutions center around both software and hardware techniques. Software 
schemes for making RAM more associative typically involve such techniques as 
hashing algorithms, software data structures, databases and neural networks. 
While these techniques have had some success, they all have a speed cost 

J5 associated with them, because each associative reference requires many RAM 
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access cycles and, correspondingly, many signal processor cycles. Nevertheless, 
when utilizing these software techniques, signal processor and memory speed 
improvements have generally kept pace with application speed requirements. 
For example, "hashing" generally refers to software algorithms basically 
5 used to store and retrieve data from memory devices. Generally, hashing 

algorithms randomly scatter data throughout the available memory space using 
various mathematical functions, such as simple multiply or divide operations. 
Essentially, hashing is the opposite of the orderly sorting of data in sequential 
memory locations. The same mathematical function is used to retrieve the stored 

10 data. Data stored >na hashing can usually be found quicker than data stored in a 
sorted, orderiy manner. 

In contrast to these various software schemes that attempt to solve the 
inherent sequential accessing problem of RAM devices, content addressable 
memory ("CAM") de\dces are known. Instead of storing data via addressed 

15 locations, a CAM comprises a plurality of memory locations accessed by the 

signal processor using a construct based on the contents of those locations. More 
specifically, instead of using an address to access a particular memory location, the 
CAM uses a "key" which contains a portion of the desired contents of a particular 
memory cell that the processor is looking for. The key itself is also stored in the 

20 allocated CAM memory space. Once the desired key has been applied by the 
processor to the CAM in a data read operation, the CAM will simultaneously 
examine all of its entries and select the stored data (ie., the "assodation") that 
matches the key. Thus, a CAM contains built-in hardware logic (e.g., a 
comparator) that performs a parallel search of stored CAM data. 

25 Thus, a CAM is essentially an associative memory that operates more 

intuitively than RAM, and somewhat similarly to human memory. An associative 
memory is generally one that allows its stored information to be retrieved based 
on a partial knowledge of that information. Since the CAM amultaneously scans 
all of its locations in parallel, a CAM is useful for applications that require the 

30 extremely fast location or placement of data. Some exemplary CAM applications 
include artifidal intelligence, pattern recognition, image processing, robotics 
control, conununications networking (e*^., high-speed routers and switches), and 
arithrnetic operations. Essentially, CAM devices find application in any system 
involving fast look-ups of large tables. CAM devices greatly speed up any 

35 application requiring search-intensive and pattern-matching fimctions. Since a 
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CAM reduces data access time by identifying data by content versus address, any 
database searching, correlating or sorting operation is made faster by use of such 
CAM devices. 

However, a CAM is not without its inherent drawbacks, despite the fact that 
5 it is extremely intuitive and fast. As compared to RAMs, the drawbacks generally 
involve relatively poor densities and high cost. These particular drawbacks stem 
from the extra hardware, provided on each CAM integrated circuit, required to 
perform the parallel search. CAM devices use comparators to find stored data. 
These comparators typically perform comparison operations on selected bits 

10 witWn the data words to match the provided key with the corresponding 

association. Because of the complexity of this extra hardware, a CAM integrated 
circuit is not able to storie data at as high a density as a RAM integrated circuit. 
This means that a smaller number of memory cells can be implemented on a CAM 
integrated circuit, thereby requiring a larger number of CAM integrated circuits 

15 (and a correspondingly larger printed circuit board area) to implement the same 
size computer memory scheme as with RAM devices. 

Other problems mih known CAM devices include the fact that, since a 
CAM includes a large amount of extra complex hardware to implement parallel 
scanning, a CAM generally does not allow for a plurality of tables of different key 

20 and association wddths and different record capadties. Therefore, if an application 
desires more than one table with each table having different key and association 
widths and record capacities, then a separate CAM device is required for each 
table. Generally, this is not practical from a cost and hardware component 
standpoint. 

25 Despite these drawbacks, CAM devices still have usage in certain 

applications, particularly telecommunications. This is because of the inherent 
speed advantage of CAM, as compared to RAM, when the software application 
calls for a speed-critical associative look-up of data. 

Therefore, what is desu-ed is a hardware approach for implementing a 

30 content addressable memory scheme that utilizes the benefits of current RAM and 
CAM devices, while eliminating the drawbacks of each type of device. 

Accordingly, it is a primary object of the present invention to leverage or 
utilize the inherent speed and mtuitiveness of associative memory techniques with 
the cost and density advantages of random access memories to implement a 

35 content addressable memory scheme. 
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It is a general object of the present invention to implement a CAM 
controller device or "engine" that transforms conventional RAM devices into 
CAM de\4ces at the hardware level 

It is another object of the present invention to interface relatively large 
capacity, low cost RAM de>dces with a host signal processor through use of a 
CAM engine. 

Still another object of the present invention is to allow for the 
implementation of data tables, within the interfaced RAM devices, having 
programmable key widths and programmable association widths for each table. 

Yet another object of the present invention is to allow for the 
implementation of more than one table stored within any one RAM device 
interfaced to the CAM engine, wherein the multiple tables can have differing key 
and association widths and differing record capacities. 

It is another object of the present invention to provide for relatively rapid 
(under 100 nanoseconds typical) matching of the provided key to the 
corresponding stored association. 

Another object of the present invention is to provide the CAM engine with 
bulk table load and unload capabilities, thereby allowing the host signal processor 
to quickly move a table between disk storage and the interfaced RAM devices. 

Another object of the present invention is to allow for rapid direct memory 
access by the host signal processor of the interfaced RAM devices. 

Yet another object of the present invention is to provide for incremental add 
and delete record capabilities with respect to the data tables stored in the 
interfaced RAM devices. 

It is another object of the present invention to allow for the partitioning of 
memory devices into multiple tables of various sizes, thereby allowing for the 
flexible configuration of a relatively large amount of RAM devices into useful 
segments or records. 

Yet another object of the present invention is to provide for hierarchical 
search capabilities within a plurality of data tables stored within the interfaced 
RAM devices. 

Still another object of the present invention is to provide for proximity 
match capabilities to locate the closest data associated with the key presented to 
the interfaced RAM devices. 
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Another object of the present invention is to provide a memory structure 
having a pipelined architecture that provides for interaction with the host signal 
processor in parallel with memory access functions such as adds, seeks and 
deletes. 

5 Yet another object of the present invention is to provide the CAM engine 

which off-loads a large amount of duties from the host signal processor in 
managing a large bank of RAM devices configured as content addressable 
memory. 

Still another object of the present invention is to eliminate the need for 
10 custom hardware or software solutions in implementing a content addressable 
memory. 

Yet another object of the present invention is to improve the performance of 
software applications involving such intensive table-driven data manipulation 
activities as data storing, correlating and/or sorting. 

The above and other objects and advantages of the present invention will 
become more readily apparent when the following description is read in 
conjunction with the accompanying drawings. 

Disclosure of Invention 

To overcome the deficiencies of the prior art and to achieve the objects 

20 listed above, the Applicant has invented a CAM "engine" or controller that 

interfaces between a known, commercially-available, host signal processor and 
known, commercially-available RAM devices. 

In its broadest aspect, the CAM engine or controller of the present invention 
is a angle-chip integrated circuit that interfaces between the processor and a 

25 plurality of RAM devices. The CAM engine essentially transforms the interfaced 
RAM into CAM in terms of data storage and access methodology and 
fimctionality. The CAM engine allows the stored RAM data to be rapidly 
accessed by the inten&ced signal processor {e.g., less than 100 nanoseconds) 
through use of a descriptor (/.e., a "key") that is related to the stored daU {i.e., 

30 the "association"). 

The CAM engine also implements certain data storage and retrieval features 
within the interfaced RAM devices. In accordance with a specific, additional 
aspect of the present invention, the CAM engine allows multiple database tables of 
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different key and association widths and different record capacities to be 
configured in a single RAM device. 

The CAM engine also implements the related concepts of Merarchical tables 
and table overflow conditions within the interfaced RAM devices. Specifically, 
5 RAM tables can have a parent/child hierarchy of theoretically unlimited depth. 
When a key is presented to a parent table and no corresponding association is 
found, the key is then presented to a child table, the key is masked to the length of 
the child table's key (i.e., the most significant bytes), and a new search for the 
association is initiated. That search does not stop until a match between the key 
and association is found, or a no-match condition occurs within a table that does 
not have a subsequent child table. Thus, tables with different key lengths can be 
linked together hierarchically and searched in sequence for the most significant 
bytes of the key. 

A second use of this hierarchy feature is for handling table overflows. 

15 Specifically, when a table becomes fiill, the host processor can configure a new 

table with exactly the same key and association structure and establish it as a child 
table (/.e., a table subservient to the parent table). The host processor can b^n 
adding records to the newly-established child table, and these records will be 
located when searching the parent table. This dynamic table configuration feature 

20 allows the CAM engine to handle table overflows transparently. 

Another aspect of the CAM engine of the present invention relates to the 
ability of the CAM engine to establish relative and linked associations within the 
interfaced RAM devices. Relative associations allow for a first bank of RAM 
devices to be configured to store the "keys", while the "associations" are stored, 

25 in the same relative order, in a second bank of RAM devices. The CAM engine 
keeps the second bank of RAM "primed" (/.e., the CAM engine keeps the rows 
pre-charged) to allow the association to be read only 10 nanoseconds (for a 100 
MHZ implementation) after the key is found. This feature greatly speeds up the 
search on the key by eliminating the need to read through irrelevant association 

30 data. 

A further aspect of the CAM engine of the present invention involves a hash 
algorithm which reduces the time for an original hash and allows for a rehash of 
every key read. This checks whether the CAM engine has read beyond the end of 
the possible words in RAM where the key could be stored. The benefit of this 
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feature is that it allows the CAM en^ne to terminate with a no-match condition as 
quickly as possible without having to store the hash of each key or to use pointers. 

Yet another aspect of the CAM engine of the present invention relates to a 
proximity match feature that functions to return the stored association data that 
5 most closely matches the applied key. This proximity match feature is invoked 
when there exists no exact identity between stored associations and the applied 
key. 

Still another aspect of the CAM engine of the present invention involves the 
use of hardware FIFOs (first-in, first-out buffers or registers), that allow the CAM 

10 engine to implement a pipelined architecture. The FIFOs allow the CAM engine 
to essentially perform a co-processor role, freeing up the host signal processor for 
tasks other than m^iory management. The FIFOs operate asynchronously with 
the remainder of the CAM engine integrated circuit to allow the host signal 
processor to communicate with the CAM engine at a slower rate, while the CAM 

15 engine communicates with the interfaced RAM at a higher rate. 

Finally, the CAM engine of the present invention implements ancillary 
features, such as table load and unload, which allow data tables to be quickly 
moved between the interfaced RAM and external storage devices, such as hard 
disk drives, where the data can be manipulated by various software applications or 

20 utilities. 



Brief Description of Drawings 

FIG. 1 is block diagram of a preferred exemplary embodiment of the CAM 
engine of the present invention; 

FIG.2 is a block diagram illustration of the CAM engine of FIG. 1 
25 connected to a host signal processor and a plurality of RAM devices; 

FIG. 3 illustrates the concept of linked associations among keys and 
associations stored in the RAM devices of FIG. 2; 

FIG, 4 illustrates the concept of relative associations among keys and 
associations stored in the RAM devices of FIG. 2; 
30 FIG. 5 illustrates the concept of the hierarchical table implementation and 

search function implemented by the CAM engine of FIG. 1; 

FIG. 6 illustrates the concept of table overflow implemented by the CAM 
engine of FIG. 1; 
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FIG. 7 illustrates a table depicting information utilized by a hash function 
implemented by the CAM engine of FIG. 1 ; 

FIG. 8 is a graphical illustration of a hardware implementation of a hash 
algorithm implemented by the CAM engine of FIG. 1; 
5 FIG. 9 is a more detailed representation of the algorithm of FIG. 8; and 

FIG. 10 illustrates an example of a record addition process implemented by 
the CAM engine of FIG. 1. 

Best Mode for Carrying Out the Invention 

Referring to FIG. 1, there illustrated is a functional block diagram of a 

10 preferred exemplary embodiment of the CAM engine 100 of the present invention. 
As the term is used herein; "CAM engine" essentially refers to a single-chip 
integrated circuit 100 that functions as a memory controller device. The CAM 
engine 100 provides for direct memoiy access of RAM via keys instead of 
physical addresses. Also illustrated in FIG. 1 are the signal lines that connect the 

15 CAM engine 100 with various ©eternal components, discussed hereinafter. These 
signal lines are arranged in three groups: "CAM Port", "System" and "RAM 
Port". 

FIG. 2 illustrates an exemplary embodiment of the CAM engine 100 of FIG. 
1 connected to both a host signal processor 104 {e,g,, a commercially-available 

20 microprocessor) and a plurality of commercially-available RAM devices 108 
C*RAM #1 - RAM m') via the signal lines of nG. 1. The RAM 108 may 
comprise static RAM ("SRAM") devices, synchronous dynamic RAM 
("SDRAM") devices, or other types of RAM or other memory devices whose 
storage locations are normally not selected using a construct that is somewhat 

25 logically related to the stored data. 

The CAM engine also connects with assorted system components 112, such 
as an oscillator clock circuit. The CAM engine 100 may be directly mapped into 
the address space of the host signal processor 104, or by utilizing a separate 
address decoder (not shown) to derive the chip select signal, "CS/" in FIG. 1, for 

30 the CAM en^ne firom the host's address bus. 

In operation, the CAM engine 100 transforms the RAM 108 into CAM, at 
the hardware level, in the manner which data are both stored and retrieved in 
RAM. The RAM 108 stores both keys and associations in tables and returns 
associations based on keys presented by the processor 104 to the CAM engine 
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100. This associative manner of identifying stored data for retrieval is in contrast 
to the usual way RAM 108 is accessed using physical addresses of the RAM 
storage locations. 

Although the contents of the keys could represent some portion of the 
5 contents of the corresponding associations, the Iceys, if desired, may contain no 
part of the contents of the corresponding associations. As such, the Iceys are still 
related to the corresponding associations. This is somewhat different from 
conventional CAMs which have the keys represent a portion of the corresponding 
associations. Nevertheless, as compared to location addresses, the keys are more 
10 logically related to the associations, whether the keys represent a portion of the 
associations or not. 

Even though the CAM engine 100 is disclosed herein as being a separate 
component from the one or more interfaced RAM devices 108, the present 
invention contemplates that both the CAM engine circuitry and the RAM cells 

15 could conceivably occupy the same integrated circuit component. Similarly, the 
host processor 104 is disclosed herein as being separate from the CAM engine 
100. However, some amount of signal processing circuitry may be implemented 
on the same integrated circuit as the CAM engine 1 00, with or without RAM cells 
also being on that integrated circuit. 

20 The CAM engine 1 00 is fabricated using known integrated circuit 

techniques. In a preferred exemplary embodiment, the CAM engine integrated 
circuit 100 is based on 0.35 micron technology, and operates at 100 MHZ. With 
the CAM engine operating at 100 MHZ, host signal processors 104 can operate at 
speeds up to 100 MHZ. The 100 MHZ clock signal may originate from the 

25 oscillator circuit 1 12 connected to the CAM engine as the "CLK'' signal of FIG. 1 . 
At that clock speed, the CAM engine can provide for association matches (/.e., 
matches between selected stored keys and corresponding stored associations) in as 
little as 45 nanoseconds when the CAM engine is connected with fast SRAM 
memory devices. On the other hand, when used with SDRAM and running at 100 

30 MHZ, the association match is typically 85 nanoseconds. These speeds rival those 
of conventional CAM devices. 

The CAM engine 100 interfaces with the host signal processor 104 through 
a configurable, high-speed, 64.bit, bi-directional data bus 116 ("Do-Dgj"). The 
data bus 1 16 facilitates data exchange between the host and the CAM engine. The 

35 CAM engine's four-bit command bus 120 ("AC0-AC3") may be connected directly 
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to the lower four bits of the host signal processor's address bus. The command 
bus 120 commumcates fourteen exemplary commands, described hereinafter, from 
the host 104 to the CAM engine 100. These commands direct the operation of 
the intemal signal processing circuitry of the CAM engine in controlling the 
5 interfaced RAM devices 1 08. The data bus 1 1 6 communicates various 

information between the host and CAM engine, including information regarding 
the setup and configuration of tables, keys and associations. 

The remaining external signals connected between the host signal processor 
104 and the CAM engine 100 at the "CAM Port" include various unidirectional 

10 and bi-directional input and output ("I/O") signals (e,g., "CS/", "RD/", "WR/", 
"RDY/", "RD_INT/". "WR^INT| BUSY/") that implement various standard 
control fiinctions associated with an "off-the-shelf microprocessor, such as chip 
select, memory read, memory write, ready, read interrupt, and a selectable write 
interrupt or write busy signal. A clock signal, "CLK", and a master reset signal, 

15 "MSRT/", are also provided as inputs to the CAM engine 100 from other system 
components 1 12 connected with the CAM engine. 

The CAM engine 100 connects to one or more RAM devices 108 through 
various external signals illustrated in FIGS. I and 2 as the "RAM Port", These 
signals include a 21-bit address bus 124, "Ao-Ajo", a 64-bit, bi-directional memory 

20 data bus 128, "Mo-M^j", and various control signals (e.g., "CKE", "CS/", "RAS/", 
"CAS/", "WEr) that implement standard control functions associated with "oflF- 
the-sheir RAM donees, such as clock activation, chip select, row address select, 
column address select and write enable. 

All of the I/O ^gnals communicated between the CAM engine 100 and the 

25 host 104 are buffered in separate input and output FIFOs 132,136 (i.e., "first-in. 
first-out" buffers or registers) intemal to the CAM engine integrated circuit 100. 
The FIFOs 132,136 allow the host 104 to communicate commands and data wath 
the CAM engine while the CAM engine simultaneously performs other operations. 
Thus, the different system components can operate at different speeds from the 

30 CAM engine 100. This parallel execution allows for a pipelined system 

architecture, thereby achieving maximum throughput from the CAM engine 100 
without the CAM engine causing a "bottleneck" for the overall microprocessor- 
based system, as is often the case with RAM devices operating in their standard, 
location-addressed manner. 
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The input FIFO 132 buffers commands and data written to the CAM engine 
100 by the host processor 104. The commands are executed by the CAM en^ne 
in the order presented. The resulting status and data are presented in the output 
FIFO 136 where they can be read by the host The FIFOs 132.136 operate 
5 asynchronously with the rest of the functional blocks of the CAM engine 100, 
described hereinafter. This allows the host 104 to communicate with the CAM 
engine at a slower, more convenient dock rate, if desired, while the CAM engine 
operates at lOO MHZ. Alternatively, the host processor 104 can operate on the 
same clock and at the same speed as the CAM engine. 

10 The data bus 1 16 connected between the host 104 and CAM engine 100 can 

be configured as either two separate independent 32-bit input and output buses, or 
as a single bi-directional 64-bit bus. Regardless of the configuration chosen, the 
RAM 108 is configured in 64-bit data words, in an exemplary embodiment. The 
bus configuration selection is presented to the CAM engine 100 with the "initialize 

15 memory command fi*om the host processor 104. Upon power up, the CAM 
engine 100 is initially configured for independent 32-bit buses. Sixty-four-bit 
architectures present the 32-bit "initialize memory" command in the low order bits 
which map to the input bus while the CAM engine is configured for independent 
32-bit input and output buses. 

20 The CAM en^ne 1 00 supports polling or intenrupt-driven architectures for 

the host 104. This allows for flexibility to optimize the host interface architecture 
and to maximize the throughput of the CAM engine. For polling, the CAM 
engine can be polled by reading its internal status register. One of the bits 
returned indicates if there is a completed result in the output FIFO 136. If so, 

25 other bits indicate whether the operation was successfijl or not. Still other bits 
report an error code if there was an error. In the case of a "seek" command (upon 
which the CAM engine seeks a key and corresponding association stored m RAM 
108 based upon a key presented by the host to the CAM engine), the status 
register can be polled until it indicates completion. If the status bit indicates 

30 success, then a "read data" command can be executed by the CAM engine to 
return the association to the host 104. 

On the other hand, an interrupt pin ("RD__INT/") on the CAM engine 100 
can be used in interrupt-driven architectures to cause an interrupt to occur when 
output data becomes available in the output FIFO 136. This can be either a pulse 
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or a level signal. After responding to the interrupt with a read, the CAM engine 
automatically resets the interrupt. 

The binary logic states of the signal lines of the four-bit control bus 120, 
AC0-AC3, define fourteen separate commands communicated by the host signal 
processor 104 to the input FIFO 132 of the CAM engine 100, in a preferred 
exemplary embodiment. Twelve of these commands define RAM setup and 
management operations carried out by the CAM engine. The remaining two 
commands allow the host processor 1 04 to retrieve information from the output 
FIFO 136 of the CAM engine. 

The fourteen commands include "initialize memory", "configure new table'*, 
"set hierarchy", "set context to existing table", "delete table", **unload table", 
"load table", "count records", "add record", "delete record", "seek (exact 
match)", "seek (proximity match)", "read status" and "read data". 

The first command the host signal processor 104 sends to the CAM engine 
100 is the "initialize memory" command. This command includes information that 
identifies the type (i.e. . SDRAM or SRAM) and size (i.e., number of 64-bit 
words) of interfaced RAM 108 available to the CAM engine. This information is 
contained within certain bits of a word on the data bus 1 16. Other information on 
the data bus 1 16 associated with this command includes the number of SDRAM 
banks, the number of tables allowed, the bus width (i.e., separate 32-bit input and 
output buses, or a single, 64-bit, bi-directional bus), and the interrupt type {i.e,, 
pulse or level). The overall size of RAM 108 connected to the CAM engine 100 
can be any power of two from 2*** to 2^^. This gives the CAM en^e the 
flexibility to drive from 8k bytes to 256 gigabits of RAM. 

When the CAM engine 100 receives the "initialize memory" command, the 
initialization fijnction 140 illustrated in FIG. 1 is invoked. This function 140 is 
essentially an algorithm or sequence of steps carried out by signal processing 
circuitry within the CAM engine integrated circuit 100. Some of the specific 
exemplary tasks implemented in the initialization function 140 include allocating 
an area in memory for storing table configuration data and zeroing these memory 
locations in RAM 108. Also, all "on-chip" CAM engine configuration variables 
are cleared and the control data are setup depending upon the number of 
permitted RAM configurations. 

For example, RAM 108 may be configured in either a single-bank or a two- 
bank configuration. In a two-bank configuration, the first two words of the 



14 



wo 99/26139 



PCTAJS98/24288 



configuration area are set to contain the size of the free block in the upper 32 bits, 
while a zero forward pointer is set, in the lower 32 bits, to indicate no more free 
blocks in the bank. On the other hand, a single-bank configuration may be chosen 
when external memory does not support multiple banks, or for deep table 
configurations with large associations, or when all of the tables are linked 
associations. Depending upon the table configuration chosen, more RAM storage 
might be usable in a single-bank configuration. In a single-bank configuration, the 
entire available RAM 108 is configured as bank zero, while the bank one free 
pointer is set to zero to indicate there is no fi-ee space in bank one, WitWn dther a 
single-bank or two-bank configuration of RAM 108, various table configurations 
are possible. 

In embodiments of the CAM engine 1 00 that support only a single table 
configuration, the configuration function 144, illustrated in FIG. 1. is invoked 
upon completion of the initialization fiinction 140. This function sets the CAM 
engine to a predetermined configuration. In contrast, for embodiments of the 
CAM eng^ne that support multiple table configurations, the configuration fimction 
144 is invoked as a result of a specific configuration request fi-om the host 104. 

For example, if the host 104 sends a "configure new table" command to the 
CAM engine 100, the configuration function 144 uses the executive fimction 148 
of FIG. 1 to check if the new table specified is already in use. If so, this is an enror 
that is reported to the executive function 148. If the specified new table is not 
already in use, the new table configuration is setup using the contents of certain 
bits of the data bus 1 16 specified by the host 104. Specifically initialized are 
various local variables, such as the depth of the new table, the key and association 
widths, the hash factor, the prime delta, the hash shift, and the base addresses for 
the key storage and association storage. 

The CAM engine 100 may implement various possible table structures in 
RAM 108. In a preferred exemplary embodiment, the CAM engine, upon request 
fi"om the host 104, may partition RAM into a single CAM table, or into as many as 
8096 independent, uniquely-configured CAM tables. The tables may have 
different record capacities. This large number of independent tables supports 
multiple software application processes and allows a single process to manipulate 
multiple tables. The resuhing depth, width and number of tables is dependent 
upon the total size of RAM 108 and the ultimate user configuration. 
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The tables store both keys and associations. Regardless of table size, each 
table has a programmable key width which may vary from 2 to 32 bytes (256 bits) 
in length. The key width is specified when a new table is configured. Different 
CAM tables can have different key widths, ^ving application designers 
considerable flexibility. The association word width is also programmable from 1 
byte to 8 megabytes (up to 2^ 64-bit words). The association width is also 
specified when a new table is configured. Different CAM tables can be configured 
with different association widths. Any combination of key widths, association 
widths and table sizes may be utilized within the constraints of the RAM devices 
108 chosen to interface with the CAM engine 100 of the present invention. 

The 8096 tables can be independently configured to 360 different 
combinations of key widths and association widths. Tables can be sized to 15 
different depths with record capacities ranging from as few as 64 records to as 
many as 32 million records. The record depths allocated for a given table are 
expressed as an exponent of two. Tables can be sized from 2^ to 2^ records. The 
key width, association width and record capacity table parameters are specified by 
the logic states of certain bits within a 64-bit data word transmitted over the data 
bus 116 to the CAM engine with the "configure new table" command. 

The RAM storage layout is determined based upon the key width, 
association width, and an "extra wide table" Boolean. The "configure new table" 
command causes the CAM engine to automatically configure RAM storage for 
optimal performance. The algorithm for selecting the record layout, discussed 
hereinafter, prevents the CAM engine from unnecessarily reading associations in 
its burst read search through consecutive words containing keys. The general rule 
for determining the number of 64-bit RAM words required for each record is 
shown in the following pseudo code snippet: 

if(ExtraWideTable)WordsPerRecord = 2^*^^^^"* + ceiling(KeyWidth / 8); 

else if((KeyWidth + AssocWidth) < 9)WordsPerRecord - 1; 

else if(KeyWidth < 9)WordsPerRecord = 1 + ceiling(AssocWidth / 8); 

else if((KeyWidth + AssocWidth) < 17)WordsPerRecord = 2; 

else if(KeyWidth < 17)WordsPerRecord = 2 + ceiling(AssocWidth / 8); 

else if((KeyWidth + AssocWidth) < 33)WordsPerRecord = 4; 

else WordsPerRecord = 4 + cdling( AssocWidth / 8); 
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10 



Generally, application performance is a function of the packing density of 
the tables. It is recommended that tables be limited to seventy-five percent 
packing density to maintain good performance. All totaled, there are 6480 
different CAM table configurations possible with the preferred exemplary 
embodiment of the CAM engine 100 of the present invention. This flexibility 
allows for development of sophisticated content addressable databases. 

The CAM engine 100 may create virtual CAM tables within RAM 108. 
This feature allows software applications to utilize the interfeced RAM for 
multiple purposes and to manage RAM resources dynamically. The ability to 
scale applications is also an important flexibility of the CAM engine. The 
performance characteristics of tables with different record capacities are virtually 
identical. Thus, an application can respond to increases in data volume without 
impacting CAM engine performance. This allows for new generations of CAM 
applications to maximize reuse of previous architectures. 
15 The CAM engine 1 00 supports two types of RAM table widths: normal 

tables and extra-wide tables. The types differ m the way associations are 
represented and the way tables are internally managed. A Boolean indicator, 
presented by the host 104 as part of the 64-bit data word in conjunction with the 
"configure new table" command, identifies the table type. The valid association 
20 widths for normal tables range from 0 to 32 bytes. One of tiiese 33 widths must 
be specified when a new normal table is configured. 

Validation lists are defined when an association width of zero is specified. 
In the case of such tables, no association is returned by the CAM engine 100 in 
response to a "seek" command fixjm die host 104. Instead, only tiie status bit is 
25 set to mdicate if tiie key ©dsts in tiie table. 

The valid association vndths for exU^-wide tables range fi-om 0 to 23. This 
number represents an exponent of 2 timt indicates the number of bytes in tiie 
association. Thus, the possible association widflis for extra-wide tables range 
fi-om 2" through 2^ bytes. One of these 24 widths must be specified when an 
extra-wide table is configured. The distinguishing characteristic of extra-wide 
tables is that the association storage is not acquired until a record is added and 
released for possible use by other tables when a record is deleted. While extra- 
wide table configurations are a littie slower than normal tables, they are especially 
usefiil v/hsn assodations are very large, such as a bit map. 



30 
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For a "configure new table" command, the CAM engine also configures the 
table depth in terms of number of records. The table depth may take on one of 1 8 
different values, ranging from a record capacity of approximately 64 records up to 
32 million records. The RAM layout or configuration is determined by the 
5 specified key width and association width. A configuration is automatically 
selected to minimize the number of key words that need to be read for a seek. 
This can be adjusted for the word width employed in the particular embodiment of 
the invention. 

When the CAM engine receives a "configure new table" command from the 

10 host 104, the key and association widths requested are checked for validity. Next, 
if an extra wide table is requested, a table is configured for 1, 2 or 4 words of 
primary table record width after allowing for the key and a 32-bit linked 
association pointer. An area just large enough for the association is allocated as 
each record is added, and de-allocated as the record is deleted. Thus, only the 

15 amount of memory required to actually store associations is used. This is the 

concept of linked associations, several examples of which are illustrated in FIG. 3. 

When the key and association fit together in I, 2 or 4 words, without 
increasing the number of words read to search for the key, then the key and 
association share the primary table area record. This is the concept of embedded 

20 associations. 

If an extra wide table is not requested and if the key and association fit into 
a single word, then a single word embedded association table type will be selected 
and both the key and association v^U be stored in the primary table area. 
Otherwise, if the key fits in a single word, then the key is stored in a single word 

25 per record m the primary table area and the association will be stored in 1 , 2 or 4 
words (as needed) in the secondary table area. If two banks of RAM are 
employed, then an attempt is made to place this secondary table area in the second 
RAM bank. This allows the appropriate row in the second table to be primed 
while the key is searched for in the first bank. This is the concept of relative 

30 associations, as illustrated in FIG. 4. 

If neither of these cases are true, the same rules are applied for 2 and 4- 
word key widths until the optimum table configuration is selected. 

Other local variables initialized upon receipt of the "configure new table" 
command include the hash factor, wWch is the number of key bits per hash 

35 iteration. This variable is set fi-om the size ROM ("Read Only Memory") table 
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indexed by table size. The hash algorithm is described in detail hereinafter. The 
prime delta variable is similar in that it is also set from the size ROM table indexed 
by table size. This variable indicates if the hash fUnction uses minus 1 or minus 3. 
The hash shift variable indicates the number of bits to shift hash left to index the 
key in RAM. If the key width is greater than 3, the hash shift variable equals zero. 
Otherwise, the hash shift equals the key width. 

For example, for table sizes beginning at 0 and incrementing by one until 14 
(as indicated by certain bits in the data word on the data bus 1 1 6), the 
corresponding hash factor is 7, 9, 10, 12, 13, 14, 17, 19, 20, 21. 22. 23, 24, 25 
and 26, respectively, while the prime delta is 1, 3, 3, 3, 1. 3, 1, 1, 3. 3, 3. 1, 3. 3, 
and 3, respectively. 

The key base register describes the base address for storage of the key in 
RAM, while the association base register describes the base address for storage of 
the association in RAM. The k6y base and association base are both read from the 
configuration section for a reconfiguration or is set after memory allocation for a 
new configuration. 

The configuration data for the first few tables are kept in registers. The 
number of such tables depends upon the particular implementation, but right is a 
likely number. The purpose of this is to allow rapid context switching between 
the most fi-equently used tables. The configuration data for the remaining tables 
are kept in the off-chip memoiy area allocated for this purpose by the "initialize 
memory" command. 

When the CAM engine 100 receives the "configure new table" command, 
the key and association storage space is allocated. Initially, the executive fimction 
148 of HG. 1 is invoked to allocate 64 + 2<**^«*'*'««-vT.bi,width) ^^^^^ 
storage where 2'*-^'^'** is the number of words or primary table area storage 
per record. The primary table area stores the key and pointer for linked 
association table types, tiie key and association for embedded association table 
types, and the key alone for relative association fable types. For relative 
association table types, the executive fimction 148 of FIG. 1 is fiirther invoked to 
allocate 64 + 2fs«"»««yT.biewkiu.) ^^^^^ of storage where 2^"'^'*^'^ is the 
number of words of storage required for the association. The additional 64 
words in either case are included to accommodate a potential hash heap extending 
bqrond the greatest hash address for the given table size. The number of words 
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needed to store the primary table records is calculated by the following C code 
snippet: 

.•I 

PrimaryTableStorage = 1 « (HashFactor + PrimaryTableWidth); 

The primary table storage is always requested from bank zero. The 
executive function 148 will allocate space in bank zero is possible. If no bank 
zero space is available, but space is available in bank one, the executive function 
will allocate space in that bank without returning an error. If the executive 
function indicates success, the register "key base" is set to the address returned. 
Otherwise, the configuration is terminated. 

If the table is a relative association type (FIG. 4), the above process is 
repeated for the association storage. This time, RAM is requested from bank one, 
but bank zero might be substituted if necessary. The calculation of association 
storage is accomplished in the following C code snippet: 

WordsOfAssocStorage = 64 + (1 « (HashFactor + AssocWidth)); 

If the executive function 148 indicates success, the register "association 
base'' is set to the address returned. Otherwise, the configuration is terminated. 
Both the key base and the association base are stored in the first word of the 
context entry for the table, A one is stored in the first word of the key storage. 
The remaining words are initialized to zero. 

When the CAM engine 100 receives a "set context to existing table" 
command from the host signal processor 104, the CAM engine identifies the RAM 
table which currently has context. Generally, this command must be performed 
after the "configure new table" command and before any record operations can be 
p^ormed on that table. Also, this command must be called to change the context 
to perform record operations on a different table. 

Upon receiving the "set context to existing table" command, the CAM 
en^ne checks whether the table number specified exceeds the valid limit for the 
particular embodiment. If so, the executive fiinction 148 in FIG. 1 is invoked to 
report an error and the command processmg is terminated. If not, the Mecutive 
fiinction 148 is invoked to read the two configuration words stored in RAM or 
registers and mto global registers. If the contents of the second word are zero. 
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then the table specified on the data bus has not been previously set. If so, the 
executive function is invoked to report an error and the command processing is 
terminated. Otherwise, the configuration function 144, described above with 
respect to the "configure new table" command, with regard to the initialization of 
local variables, is followed. However, the local variables "key base" and 
"association base" are set fi-om the first word read fi-om the CAM table's 
configuration records. 

The CAM engine 100 responds to a command firom the host signal 
processor 104 to "set hierarchy". The CAM engine defines a parent/child 
relationship between two tables, implemented in RAM, in preparation for 
hierarchical searches. The hierarchical search capability allows tables with 
different key lengths to be linked hierarchically and searched in sequence for the 
most significant bytes of the key. This feature also allows tables to be dynamically 
expanded if a table overflow occurs. In this case, the same key lengths are used 
for both the parent and the child tables. 

The CAM en^ne 100 implements a simple hierarchical search capability 
which supports a longest prefix match. A CAM table implemented in RAM 108 
(/.e., a "parent" table) may optionally be designated as having another specific 
table as a "child". • Association searches which fail to find an «cact match for the 
key in the parent table automatically cascade to their child table, if one has been 
designated. 

FIG. 5 illustrates an example of this concept using three tables, each table 
having 128-bit association widths, table 1 having an 80-bit key width, table 2 
having a 48-bit key width, and table 3 having a 24-bit key width. The key being 
searched for is automatically masked to the length of the key of the child table, 
thereby preserving the high order bytes of the key. The child table (table 2) is 
searched and, if an exact match is not found and the chUd table has another 
subsequent child table (table 3), the CAM engine automaticaUy cascades to the 
new child table and the search process is repeated. As soon as a match is found in 
a child table, the corresponding association is returned. If an exact match cannot 
be found in any hierarchical tables, the CAM engine will return a no-match status. 

This hierarchical table structure and search capability is especially usefiil for 
telecommunications applications. Considerable time is saved by not having to 
pass the no-match condition out of the CAM engine, the host processor having to 
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decide what to do with such condition, and having to present a context switch and 
the key back to the CAM engine. 

In addition to supporting a prefix match, the hierarchy feature also provides 
the general data management function of table overflow control. Sometimes, data 
5 capture applications cannot properly size a table to handle all posable 

circumstances. When an application determines that a table has reached its 
storage capacity, a new table with the same key and association lengths can be 
created and set as hierarchically subservient to the original table. New records can 
be added to the newly-created table and seeks can continue to be performed 

10 against the original table. At a slight cost in performance, the application can 
continue to run without any awareness that a table has gone into overflow mode. 
Thus, overflow conditions are handled transparently. This is in contrast to 
conventional CAMs, where if a table fills up, records must be removed before a 
new record can be added. FIG. 6 illustrates this concept of table overflow, in 

15 which the original table, table 1 , has overflowed into table 2. 

The "set hierarchy" command is optional, and can be performed at any time 
after the parent and child tables have both been configured. Also, it can be 
performed at any time to re-define a parent/chUd relationship. The command has 
no effect until context is set to a table defined as a parent. 

20 The CAM engine configuration function 144 of FIG. 1 responds to a "delete 

table'' command fi-om the host 104, For this command, the CAM engine 100 
simply deletes the entire table specified, along with all of its records. If the table is 
an extra v/ide table, then all linked associations are deleted. Note, this command 
IS diflTerent firom the "delete record" command, discussed hereinafter. 

25 The CAM en^ne 100 responds to a "count records" command fi*om the 

host 104. This feature assists the software application to determine if a table is 
full and whether it needs to take corrective action. The CAM engine counts the 
mmiber of records in the table which currently has context, and returns the count 
to the host. In taking the corrective action, the application might delete old 

30 records, create an overflow table or restructure or re-size the table. This might be 
usefiil for periodic monitoring of CAM tables to determine if a table is becoming 
fiiU and if corrective action should be taken. This command may be performed 
any time after the appropriate table has had context set to it. 

The CAM engine 100 of the present invention implements table load and 

35 unload features in response to "load table" and *\inload table" commands, 
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respectively, from the host 104. These features work together with software 
database programs to support database maintenance functions such as backup and 
recovery, and table and database restructuring. 

For example, it is often desirable to periodically back-up a table. Also, it is 
5 desirable to restore a backed-up table without having to perform successive add- 
record operations. It may also be desirable for some applications to initialize a 
table to a previously determined set of values. The table load and unload features 
allow applications to easily load and unload CAM tables to and from RAM 108. 
Also, tables for entire databases can be restructured to de-fragment memory 

10 or to combine tables which have gone into overflow. The CAM en^e 100 

supports these activities by allowing tables to be unloaded from RAM 108 to the 
host 104, then manipulated by a database utility program, and reloaded into RAM 
back from the host. These two commands can be performed any time after the 
appropriate table has had context set to it. 

15 If a "unload table" command is requested, the CAM engine will first output 

its configuration word to the data bus. Then, beginning with word zero, the CAM 
engine wiU sequentially output each RAM word within the range of the 
configuration. A handshake with the application will accompany each word. A 
time-out waiting for a handshake from the application will result in an error and 

20 terminate tiie unload. Upon completion of the unload, the CAM en^ne will 
invoke the executive function 148 to set the status bit to indicate that it was 
successful and that the unload was complete. 

If a "load table" command is requested, the configuration must exactiy 
match the configuration of the table previously unloaded. The current 

25 configuration word is compared with the word on the data bus. If they are not 
equal, the executive function 148 will be initiated to flag the error and terminate 
the load. Otfierwise, the CAM engine will store each word passed in tibe load in 
consecutive words of tiie current table. The handshake and time-out logic is tfie 
same as tiie unload. If a time-out occurs during a load, the table is reinitialized, 

30 When successfully completed, the original contents of tiie table are entirely 

replaced by the load and the CAM engine 100 will invoke the executive fimction 
148 to set the status bit to indicate that it was successfiil and that the load was 
complete. 

Similar to the other functional blocks illustrated in FIG. 1 as being part of 
35 the CAM engine 100 of the present invention, the executive function 148 
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comprises signal processing circuitry within the CAM engine integrated circuit. 
The executive function performs such tasks as RAM management, control and 
refresh, data bus management, status and error reporting, and performance 
monitoring. The RAM management function allocates free RAM space and 
3 returns deleted table space to the RAM free space pool. The requesting function 
specifies the number of words required and provides a Boolean variable indicating 
which RAM bank is the first choice. RAM allocation is performed using a simple 
first fit algorithm. The chain pointer for the bank requested is walked until a free 
block large enough to accommodate the request is found. If such a block does not 

10 exist in the bank requested, then allocation from the other bank is attempted. If 
neither bank has sufficient free space, an error is reported. On the other hand, 
RAM release is performed by absorbing the released block into free blocks on 
either side of it, if possible, or by adding the block to the free chain. In either 
case, RAM is not zeroed when released. It is zeroed vAien allocated for key 

15 portions only. 

The RAM control function provides RAM read and write services for all 
other CAM engine processes. The address, a read/write flag, and burst length are 
provided when the service is requested, the RAM control function also handles 
requests to terminate a burst read. A gate is set by the function while it has 

20 control of the RAM. If a RAM refresh is underway, the RAM control function 
will wait. 

The RAM refresh function counts clock cycles and, depending upon the 
clock speed, initiates a standard distributed GBR refresh as soon as the RAM is 
not busy and the appropriate number of cycles have passed. This executive 
25 process is performed in parallel with all other CAM engme processes and 

communicates via a gate with the RAM control function to show that the RAM is 
busy during the refresh cycle. 

The executive function 148 of FIG. 1 also controls all chip I/Os. This 
includes the I/O through the bi-directional host data bus 1 16 and the bi-directional 
30 memory bus 128. 

All host write commands are buffered through the input FIFO 132 by a 
FIFO manager, which is part of the executive fimction 148. From the input FIFO, 
the FIFO manager initiates modules to perform specific tasks. Each module 
returns a status to the report status module of the executive function 148 and 
35 returns data into the output FIFO 1 36 along with the status data provided by the 
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report status module. The FIFO manager responds to the host read commands by 
providing status and data from the output FIFO 136 onto the host data bus 1 16. 

The CAM engine 100 of the present invention implements a hash algorithm 
to determine the optimum location in RAM 108 to store data and to determine 
5 where data are stored in RAM. The hash algorithm hashes keys presented on the 
data bus 1 16 by the host 104 to the CAM engine. Key hashing is performed, for 
example, in response to the "seek (exact match)" , the "delete" and the "add 
record" commands, described hereinafter. The hash algorithm is also used to 
rehash k^^s read from RAM to determine the depth of consecutive words in RAM 

10 that share the hash code of interest. 

In a preferred exemplary embodiment, the CAM engine 100 of the present 
invention employs various different hash algorithms, including a base 10 modulus 
hash and several fast hash algorithms. The hash algorithm selected must be 
specified when a table is configured and is used for all hash interactions for that 

15 table. This allows a user to select the hash algorithm which performs best with a 
particular set of data. While the fiill modulus hash (/.e., base 10 hash) algorithm 
takes longer to calculate than the fast hash algorithms, it may produce a better 
distribution. 

The base 10 modulus hash function generates a hash code by calculating the 
20 modulus of a prime or "near prime" number just slightly smaller than the address 
space of RAM being addressed. The hash code can be calculated using the 
following equation; 

for integers KEY, N and PD where 2^ > PD; 

KEY mod (2^ - PD) - ((KEY mod 2^^ + ((integer(KEY/ 2^)*PD))) mod 
25 (2*'-PD) 

This property allows the normally time consuming modulus function to be 
simplified into a number of shifts, additions and a subtraction, N and PD are 
constants set upon initialization of RAM. N is selected so that 2^ equals the 
number of target addresses available. For the address spaces of interest, PD = 3 
30 or PD = 1 were selected because for these address spaces, either 2^ - 3 or 2^ - 1 
has the desirable characteristic of being either prime or having its smallest prime 
factor > 20. Numbers divisible by less than 20 are generally not suitable as the 
divisor for a hash modulus. The table of FIG. 7 illustrates the divisor of choice for 
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the address spaces 2^ supported by the CAM engine. PD represents the prime 
delta used by the modulus hash function. 

The following C code snippet illustrates how the hash would be performed 
for a 64-bit key if high level functions were utilized: 

5 void CCAMDig::OnHashHighLevel(unsignedJnt64 key, unsigned Jnt32 
addressBits) 

{ 

unsigned Jnt32 hash 

hash = unsigned Jnt32(key % (pow(2, addressBits) - PD)); 

10 } 

The following C code snippet more closely represents the hardware 
implementation of the high speed modulus algorithm for a 64-bit key using only 
low level functions: 

void CCAMDig::OnLowLevel(unsignedJnt64 key, unsigned Jnt32 addressBits) 

unsigned Jnt64 temp; 
unsigned_int32 hash divisor; 
int highBits, is; 



highBits « 64 - addressBits 
20 if(PD-3)is = 2elseis=l; 

divisor = (OxiTFFFBFFFFFFFFFF « highBits » highBits) « is » is 

temp = (key « highBits » highBits) + (key » addressBits) 
+ (key » addressBits « 1); 

temp = (temp « highBits » highBhs) + (temp » addressBits) 
25 + (temp » addressBits « 1); 

hash = unsigned Jnt32((temp « highBits » highBits) 

+ (temp » addressBits) + (temp » addressBits « 1)); 

if (hash >= divisor) hash = (hash & 0x3) - 1; 
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The final modulus is performed by a single conditional subtraction to 
properly compute the modulus for the cases 2^-1, 2^ - 2, and 2*^ - 3, The actual 
hardware implementation is depicted graphically in FIG, 8 for a 64-bit key, FIG. 9 
is a table depicting a more detailed representation of the hash algorithm. In FIG. 
5 9, once step 1 has completed processing bit N, step 2 can begin to run in parallel, 
Siniilarly, once bits 0 and 1 of the preliminary hash result are available, they can be 
extracted and made ready m case they become the final hash. 

For shallow CAM tables and/or long keys, additional parallel processes can 
be employed to speed the hash calculation. In these cases, the pattern above is 
10 followed for each successive N bits of the key. Approximately, the number of 
parallel processes that can be effectively utilized is roughly represented by the 
formula: 

NumParallelProcesses - Ihteger(NumKeyBits / N) 

In the actual silicon implementation of the hash algorithm, logic is employed to 
i5 ensure that a process is never overtaken by a chasing process and additional 

parallel processes are automatically initiated as soon as the preceding N bits of the 
key have been processed. 

As an alternative to the base 10 modulus hash, several fast hash algorithms 
allow the hash to be performed synchronously v/ith each RAM read. These 
20 algorithms are variants of base two modulus functions which differ by the bit 
alignment as a series of exclusive or operations are performed. An unequal 
fi-equency distribution of certain key bits can be compensated for by selecting the 
best alignment factor. The following code snippet illustrates the fast hash 
algorithm for a 256-bit key. The particular algorithm is determined by the input 
25 ALIGNMENT. 

val = (Key64[0] & KeyMask[0]) ^ (Key64[l] & KeyMask[l]) ^ (Key64[2] 
& KeyMask[2]) ^ (Key64[3] & KeyMask[3]); 

hash = unsigned Jnt32((val » 3 1) (val « 33 » 33)); 

hash ^ (hash » ((TableSize - ALIGNMENT))) ^ (hash « (64 - TableSize) 
30 »(64. TableSize)); 

hash = (hash » TableSize) ^ (hash « (64 - TableSize) » (64 - 
TableSize)); 
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hash = (hash » TableSize) ^ (hash « (64 - TableSize) » (64 - 
TableSize)); 

The fast hash function can hash up to 1024 bits of key in 10 nanoseconds 
using modern CMOS technology. The 10 nanoseconds is critical because it is 
5 synchronously hashing in single clock cycles runiung at 100 MHZ. This allows for 
an original hash and allows for the rehash of every key read to see if a read has 
occurred beyond the end of the possible words where the key could have been 
stored. The benefit of this is that it allows the CAM engine to terminate whh a no 
match condition as quickly as possible without having to store the hash of each 

10 key or use pointers. 

The CAM engine 100 also responds to a "seek (exact match)" command 
from the host signal processor 104. In response to this command, the CAM 
engine attempts to return the stored association of the record indexed by the key 
presented by the host in the table which currently has context. In other words, the 

15 CAM engine is looking for an exact identity between the key presented to it and a 
key mapped to an association stored in RAM, 

In a preferred exemplary embodiment, the CAM engine's seek time is 
optimized by utilizing parallel processes not only for the seek function 152 of FIG. 
1, but also for the add function 156 and the delete function 160 of FIG. 1, both 

20 discussed hereinafter. By reading, comparing, checking for firee words and 
checkmg for the end of the hash heap all in parallel, the CAM engine of the 
present invention can sustain a 10 nanosecond per word read rate. This rate can 
actually be achieved for very wide word widths. 

The parallel processes involve calculating the hash (seek) key and the 

25 segment base address, and reading the key words starting at the address just 
calculated. The CAM engine then checks for an exact match between the key 
presented and the stored key, while simultaneously performing, in parallel, a 
rehash of the key just read and a check for a no match condition. This parallel 
seek process continues until either a match condition or a no match condition is 

30 met. When either condition is met, all of the parallel processes are terminated. 

A check for a no match process checks if the hash code just calculated is 
greater than the hash code calculated firom the target key, and, if so, the executive 
fijnction 148 of FIG. 1 is invoked to set the status to no match and to terminate 
the burst read. The add function 156 of FIG. 1 ensures that heaps of keys with 
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common hash codes are stored contiguously and follow any keys with lesser hash 
codes. 

A check for a match checks if the words just read match the seek key and, if 
so, then the executive function 148 is invoked to begin the transfer of association 
5 data, to set the status to match, and to terminate the burst read. 

The CAM engine 100 of the present invention is designed to leverage the 
burst read and write capability of synchronous dynamic RAM ("SDRAM"*) and of 
pipelined Static RAM C*SRAM"). SDRAM burst read and write allow the 
reading of consecutive words in RAM at the rate of one word per clock cycle (as 

iO fast as 10 nanoseconds in current SDRAMS). In the CAM engine, the storage 
algorithm, as part of the hash algorithm, keeps all hash collisions in contiguous 
RAM, thereby allowing for burst read to speed a CAM seek and using burst write 
to push words, if necessary, to make room for newly added hash collisions. This 
is a marked improvement over prior art hash implementations that resolve hash 

15 collisions (Le,, when two keys hash to the same number) by providing pointers to 
an overflow space. This prior art process insures relatively long delays (e.g., 50 
nanoseconds or longer for SDRAM) spent priming the row address on a DRAM 
for each pointer. 

If, for a particular application, an identical match between the key presented 
20 and a stored key is not likely, the CAM engine may respond to the "seek 

(proximity match)" command from the host in implementing the seek function 152 
of FIG. 1. In responding to this command, the CAM engine attempts to return the 
association of the record whose key is closest to the key presented by the host 104 
in the table which currently has context. The CAM engine implements this 
25 pro?dmity match feature to allow an entire table to be examined to identify the 
record which most closely resembles the key presented and to return the key's 
association from that record. When performing a proximity match, as many as 
fifty million keys can be examined for closeness in one second. This feature is 
designed to support real time artificial intelligence ("AT') recognition and learning 
30 applications. 

The proximity match feature returns the association from RAM 108 of the 
closest matching record in a table. The feature is executed by the CAM engine 
100 when an exact match between the presented key and table records cannot be 
found. Besides AI applications, the proximity match feature is usefiil in image 
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systems and other applications where inexact, or "fuzzy comparisons", are useful 
and suitable. 

' As this proxinuty match feature is implemented in a preferred exemplary 
embodiment of the present invention, the CAM engine supports two separate and 
5 distinct distance formulas. Either formula may be utilized to determine closeness 
of the presented key to stored keys. The first formula is a Manhattan distance 
formula, implemented as: 

E„-T„ 

The CAM engine also supports the following Euclidean distance formula: 

10 The CAM engine distance formulas operate on dynamic element boundaries. The 
chosen boundary applies to the entire key. However the boundary can be changed 
from search to search. The CAM engine supports 4, 8, 16 and 32-bit element 
boundaries. 

The CAM engine 100 responds to an "add record" command from the host 
15 104 by adding a key and association, presented to the CAM engine by the host, to 
the table that currently has context. The add function 156 of FIG; 1 ensures that 
all keys are stored in hash sequence either in the word addressed by their hash 
code or in the next possible word following the hash address without preceding a 
key vAth a lower hash code. Fig. 10 illustrates an example of the record addition 
20 function. 

Similar to the seek function 152 described above, the add function 156 
utilizes several processes (e.g;, SDRAM burst read) operating in parallel to 
provide optimum record addition performance. Generally, only the first read of 
the addition process and the first read of a page take more than one clock cycle 

25 per word. In embodiments in which the hashing is performed in one clock cycle 
per word, the only wait state occurs when SDRAM is used (as opposed to 
SRAM) and a page boundary is crossed. If this happens, control logic ensures 
that none of the parallel processes outruns the others. 

In implementing this function, a variable Key Word is set by the 

30 configuration fimction 144 and contains the number of 64-bit words that comprise 
the key. If this variable is zero, then no configuration has been established and the 
add process will invoke the executive function 148 of FIG. 1 to return an error; 
The parallel add process continues until either a free word is found or an end of 
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hash heap condition is met. When either condition is met, all of the parallel 
processes are terminated. 

The add function handles the null key (a key with all zero bits) by reserving 
record zero of the table for it. Upon the initial configuration of a table, the 
5 "configure new table" function zeros all words of the primary table area and stores 
a one in the last word of the first record of the table. This special initialization 
indicates that the first record is fi-ee. By reserving this location for the null key 
(whether or not one is ever added) the add module is able to distinguish the 
difference between a fi-ee word and a stored null key. If the key stored in the first 

10 record of the table is all zeros, then the null key has been stored there. If not, then 
the word is fi-ee for storage of the null key. Keys which hash to zero but are not 
null are given a hash result of one which keeps them from ever being stored in the 
special location reserved for the null key. The seek and delete fimctions have 
companion processes which set the hash of keys that hash to zero to one unless 

15 they are the null key; This special processing allows.the CAM engine to accept 
null keys as valid keys. 

Associations are stored in RAM 108 in different manners, depending on the 
configuration chosen and the features supported by the particular embodiment of 
the CAM engine 100 implemented, in light of the teachings herein. However, 

20 there are four basic association storage structures: (1) no association; (2) in 

embedded configurations where the association is stored in the high order portion 
and the key in the low order portion of the primary table words; (3) for relative 
association storage, keys and associations are stored in different memory tables at 
addresses which are the same relative distance from the beginning of the tables. 

25 taking the relative widths of the keys and associations into account. This storage 
reduces the number of reads to find a key and is especially usefiil for two bank 
configurations when the tables are in different banks; and (4) for extra wide 
associations, the associations will be stored in the individually allocated portions 
of the CAM memory with pointers fi-om the key words. 

30 The CAM engine 1 00 of the present invention responds to a **delete record" 

command fi-om the host signal processor 104 by deleting the record stored in 
RAM 108 and referenced by the key presented by the host in the table that 
currently has context. The CAM engine invokes the delete fijnction 160 of FIG. 1 
by first performing a seek until the check for match process finds a match, or the 

35 check for no match process determines there is no match. If the latter condition 
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occurs, an error is reported and the delete terminates. If a match is found, an 
upstream burst read/write is initiated to shuffle all appropriate keys up the width 
of the record to fill the hole created by the delete. Relative associations are 
handled in exactly the same manner. For linked associations, the association 
5 record is returned to the association free space. 

Similar to the seek and add functions discussed above, the delete function 
utilizes the burst read capability of SDRAM or pipelined SRAM and the general 
parallel process arrangement. The parallel delete process continues until either the 
buffer is full or the end of the hash heap is found. If the buffer becomes full before 

10 the end of the hash heap is found, then context is recorded, the burst read is 
terminated, a burst read is initiated to write the buffer, context is reestablished, 
and the process begins again. 

Besides the twelve exemplary write commands, described above, sent by the 
host signal processor 104 to the CAM engine 100, the CAM engine also responds 

15 to two read commands fr-om the host In response to a "read status" command, 
the CAM engine returns the current status of the output FIFO 136 to the host 104 
on the data bus 1 16 and indicates if the command was successful and if there are 
data available to be read from the output FIFO. In response to a "read data" 
command, the CAM engine returns an output bus portion of the output FIFO. 

20 The CAM engine 1 00 of the present invention has utility in vastly improving 

the performance of table-driven software applications that implement data storing, 
correlating and/or sorting operations. The CAM engine is a hardware solution 
that leverages the density and cost benefits of traditional RAM deuces with the 
speed and intuitiveness of conventional CAM deyices. The CAM engine also oflF- 

25 loads a significantly large amount of time the host signal processor normally 
spends in performing table lookups. The CAM engine operates with dedicated 
memory, and its operation does not conflict with other memory cycles performed 
by the host signal processor. This reduces memory bottlenecks by eliminating the 
contention for the common memory bus. 

30 The CAM engine finds applications in such areas as artificial intelligence, 

pattern recognition, image processing, robotics control, radar unaging, 
disk/database caching, communications networking, and generally in applications 
that perform arithmetic operations on large sets of data. The CAM engine is 
suitable for any application that requires very fast manipulation of large tables of 

35 data. 
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The CAM engine eliminates the need for specific software implementations 
to effectively transform RAM into CAM, and also eliminates the need for a 
custom ASIC for hardware implementations. The CAM engine allows for 
software centric systems that achieve performance levels previously only available 
5 with custom hardware applications. It is difficult to accurately quantify the speed 
advantages of the CAM engine as compared to a sofbvare-based RAM search 
algorithm, because of the large number of variables involved. However, it can be 
conservatively stated that the CAM engine of the present invention, with 
association matches of 100 nanoseconds or less, can outperform a software 

10 lookup by at least an order of magnitude or better. 

By driving a large bank of RAM (e.g., up to 256 gigabits), the CAM engine 
provides for direct memory access via keys rather than physical addresses, as in 
the conventional usage of RAM, Thus, the CAM engine provides for significant 
cost and scale advantages over conventional CAM memory devices, and provides 

15 dramatically superior performance when compared with existmg database 
technology. 

Software implementations of proximity matching or distance algorithms 
normally require numerous processor cycles and memory accesses for each key 
examined. In contrast, the CAM engine's proximity match feature allows the 

20 CAM engine, operating at 1 00 MHZ, to examine up to 50 million keys per 
second. This enables the CAM engine to significantly outperform a software 
distance algorithm. In doing so, the CAM engine also offloads a significant 
amount of processor tasks. For example, in a voice recognition application, 
instead of searching a sound signature database for the closest matching sound, 

25 the host processor can be processing the next sound wavelet while the CAM 
engine performs a proximity match on a previously computed signature. 

The CAM engine of the present invention is also well-suited for high speed 
router applications where current CAM technology has been virtually abandoned 
because of price and size issues. In these communication applications, the CAM 

30 engine eliminates the need for specialized software or custom hardware solutions. 
The flexibility and speed of the CAM engine, together with the fact that it can be 
software driven, allows system designers the flexibility of software centric 
architectures with the speed of hardware solutions. Also, being allowed to choose 
between interfaced SRAM or SDRAM devices gives designers the ability to make 



33 



wo 99/26139 



PCTAJS98/24288 



speed/cost tradeoffs and to reuse portions of the design in environments with 
diflfering performance constraints. 

Artificial intelligence learning systems can benefit greatly from the proximity 
match feature implemented by the CAM engine. Real time artificial intelligence 
5 and control systems, fi-om decision support to robotics control systems, are rule 
based state machines. The CAM engine's ability to store multiple rule tables 
effectively provides such applications with a real time database. Many of the same 
benefits that general purpose databases provide for typical applications can be 
experienced by real time applications using the CAM engine of the present 
10 invention. 

The benefits derived fi-om traditional database caching to RAM can be 
greatly enhanced by caching to RAM-implemented CAM instead. Through use of 
flexibly-configured CAM tables implemented in large amounts of interfaced RAM, 
a database application can store the most recently used database keys and pointers 
15 in CAM tables and significantly improve performance when cached data are 
reused. When used in conjunction with traditional caclung techniques, the 
architecture could be similar to level one and level two caching used by 
processors. 

It should be understood by those skilled in the art that obvious structural 
20 modifications can be made without departing fi-om the scope of the invention. 
Accordingly, reference should be made primarily to the accompanying claims, 
rather than the foregoing specification, to determine the scope of the invention. 

Having thus described the invention, what is claimed is: 
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Claims 

1. A memory controller, comprising: 

receiving means for receiving a plurality of command signals 
indicative of a corresponding plurality of data storage, retrieval and manipulation 
operations performed by the memory controller in conjunction with a plurality of 
5 normally location-addressed randomly-accessible memory data storage cells; and 
signal processing means, responsive to the plurality of command 
signals, for partitioning the plurality of memory data storage cells into at least one 
data structure, for storing data in certain ones of the plurality of storage cells of 
the at least one data structure, the data stored including a key and an association 

10 wherein the key has a corresponding relationship to the association and wherein 
the normally location-addressed randomly-accessible memory data storage cells 
are operable instead as associative memory when the data are selectively stored in 
the plurality of storage cells, and for retrieving data stored in the certain ones of 
the plurality of storage cells utilizing a selected key as an index to the desired one 

15 of the plurality of memory data storage cells from which the stored association 
related to the selected key is retrieved such that the normally location-addressed 
randomly-accessible memory data storage cells are operable instead as associative 
memory when the data are selectively retrieved from the plurality of storage cells. 

2. The memory controller of Claim 1, wherein the signal processing means further 
comprises means for partitioning the plurality of memory data storage cells into a 
plurality of data structures, each one of the plurality of data structures having a 
predetermined record capacity, wherein the predetermined record capacities may 

5 differ. 

3. The memory controller of Claim 2, wherein the signal processing means further 
comprises means for storing data in certain ones of the plurality of memory data 
storage cells of each one of the plurality of data structures, the data stored 
including a plurality of keys and a plurality of associations, wherein each one of 

5 the plurality of keys has a corresponding relationship to at least one of the 
plurality of associations. 
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4. The memory controller of Claim 3, wherein each one of the plurality of keys 
has a predetermined width comprising a number of data bits and wherein the 
predetermined widths of the keys may diflFer. 

5. The memory controller of Claim 3, wherein each one of the plurality of 
associations has a predetermined width comprising a number of data bits and 
wherein the predetermined wdths of the associations may differ. 

6. The memory controller of Claim 2, wherein the signal processing means further 
comprises means for arranging at least two of the plurality of data structures into a 
hierarchical relationship wherein a first one of the data structures is designated as 
a parent table and wherein at least a subsequent one of the data structures is 

^ designated as a child table. 

7. The memory controller of Claim 6, wherein the signal processing means further 
comprises means for retrieving data stored in the parent table and in the child table 
by searching the parent table for the desired stored key and if that key is not found 
in the parent table then searching the child table for the desired stored key, 

5 wherein if the width of the key searched for in the parent table is less than the 
width of the key searched for in the child table then the key searched for in child 
table is masked to the number of data bits comprising the key stored in the child 
table. 

8» The memory controller of Claim 2, wherein the signal processing means further 
comprises means for creating an overflow data structure as one of the plurality of 
data structures when the ^gnal processing means exceeds the predetermined 
record capacity of a primary one of the plurality of data structures as the signal 
5 processing means is storing data in the primary one of the plurality of data 
structures. 

9. The memory controller of Claim 1, wherein one of the command signals 
comprises a seek exact match command wherein the signal processing means 
further comprises means for seeking an exact match between a key presented to 
the signal processing means and a key stored in certain ones of the plurality of 
5 memory data storage cells. 
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10. The memory controller of Claim 1, wherein one of the command signals 
comprises a seek proximity match command wherein the signal processing means 
further comprises means for seeking an approximate match between a key 
presented to the signal processing means and a key stored in certain ones of the 

5 plurality of memory data storage cells. 

1 1 . The memory controller of Claim 10, wherein the signal processing means 
further comprises means for seeking an approximate match between a key 
presented to the signal processing means and a key stored in certain ones of the 
plurality of memory data storage cells according to the formula E„ - T„ . 

12. The memory controller of Claim 10, wherein the signal processing means 
further comprises means for seeking an approximate match between a key 
presented to the signal processing means and a key stored in certain ones of the 

5 plurality of memory data storage cells according to the formula ((E„ - T„)^)'/2. 

.13. The memory controller of Claim 1, wherein the signal processing means 
further comprises means for storing the key and the association in certain ones of 
the plurality of memory data storage cells of the at least one data stnicture and for 
storing a linking pointer in certain ones of the plurality of memory data storage 
5 cells of the at least one data structure, wherein the linking pointer is indicative of 
the certain ones of the plurality of memory data storage cells of the at least one 
data structure where the association is stored. 

14. The memory controller of Claim 1, wherein the signal processing means 
further comprises means for storing the key in a first bank comprising certmn ones 
of the plurality of memory data storage cells of the at least one data structure and 
for storing the association in a second bank comprising certain ones of the 

5 plurality of memory daU storage cells of the at least one data structure. 

15. The memory controller of Claim 1, wherein a signal processor is operable to 
provide the plurality of command signals to the receiving means. 
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16. The memory controller of Claim 15, wherein the signal processor is on the 
same integrated circuit as the receiving means and the signal processing means. 

17. The memory controller of Claim 1, wherein the plurality of memory data 
storage cells are on the same integrated circuit as the receiving means and the 
signal processing means. 

18. The memory controller of Claim 1, wherein the signal processing means 
further comprises means for unloading a portion of the data stored in certain ones 
of the plurality of memory data storage cells of the at least one data structure to 
an selected device. 

19. The memory controller of Claim 18, wherein the signal processing means 
further comprises means for loading the portion of the data unloaded to the 
selected device into certain ones of the plurality of memory data storage cells of 
the at least one data structure. 

20. The memory controller of Claim 1, wherein the signal proces^ng means 
comprises means for storing data in certain ones of the plurality of memory data 
storage cells of the at least one data structure utilizing a hashing function based on 
mathematical operations. 



38 



wo 99/26139 



PCT/US98/24288 



1/10 



o 

< 
I 

o 



CO 




o 

GL 



O 



O 
CM- 



\J7 <J7 



CO 



ICO JO 1^ 



lo 



I 

o 



ro 
<o 

O 

I 

o 
o 



Q 



o 
a: 



>- 

00 
CD 



CO 
CO 



id 
o 



&5 



wo 99/26139 



2/10 



PCT/US98/24288 





wo 99/26139 



3/10 



PCT/US98/24288 



Key 
Width 


Associotion 
Width 


RAM Storage 


Notes 








Key is up to 
32 bits 






n Pointer 


Key 


2 to 4 
bytes 






2 AssocWidtn 


^ Association Word 0 




bytes 


Association Word 1 


Associotion is 

2AssocWidth-3 

64 bit words 




• 
• 
• 






Association Word 2^ 






n Pointer 


Key b95-64 


iNcy IS up to 

DitS 






Key bits 63-0 


5 to 19 
bytes 








^ Association Word 0 


Association is 
2AssocWidth-3 


Association Word 1 






• 
• 
• 


64 bit words 






Association Word 2A 
















p Pointer 


Key b223-192 


Key is up to 
224 bits 






Key bits 


191-128 






Key bits 


127-64 




13 to 28 
bytes 


2AssocWidth 

bytes 


Key bits 63-0 


Association is 
2AssocWidth-3 

64 bit words 




^ Association Word 0 






Association Word 1 






• 
• 
• 








Association Word 2^ 








Pointer 


Key 5479-448 


Key is up to 
480 bits 






• 
• 
• 






Key bits 


127-64 




29 to 60 


2A33ocWidth 


Key bits 63-0 


Association is 


bytes 


bytes 




2AssocWidth--3 




- Association Word 0 


64 bit words 






Association Word 1 








« 
• 
• 


Note: This key width 
may not be provided 
in all enDbodinnehts 
of this invention. 






Association Word 2^ 



FIG, 3 



wo 99/26139 



4/10 



PCTAJS98/24288 



o 
m 



e 

o 



c 
o 
m 









o 




< 
CsJ 


V 




V 


Woi 




Wor 


ion 


• • • 


ion 


1 Associat 




1 Associat 


o 










P 


Woi 


• • • 


Woi 


Key 




1 Key 



5; 



0:2 



ro 
o 

o 



4 

II s 

0} 



CM 
ro 

O 



00 



II o 
m 



wo 99/26139 



5/10 



PCTAJS98y24288 



I 

(D 



O 
CO 



00.5 

3^ 
o 



o3 
£ 



o 
c 



CO 

c 
o 



o 
o 

CO 

cn 
O 



CO 
CM 



I I 1 



CO 

a; 



JQ 
CO 



CO 

c 
g 

o 
'o 
o 

in 
o 



00 
CM 



CO 



CM 



I 

to 

0) 



CM 



€0 
C 

.2 
o 

o 

CO 
(0 

o 



00 
CM 



Si 

.o 



CM 
o 



ro 
o 



wo 99/26139 



6/10 



PCT/US98/24288 



CO 



jO 
CD 



CO 

c 
o 



o 
(n 
in 
o 



CD 



CO 



c 
o 

(O 

x: 
o 

o 

E 



>> 



CD 



J L 



CO 

c 
g 

D 

'o 

O 
CO 
CO 
D 



CO 



CO 



o 



wo 99/26139 



7/10 



PCT/US98/24288 



CO 



Q..2 



0> 
JQ 

E 

3 



2 
o 

CO 

o 



(D 

•4-' 

o 

o 
O 



(N 



O 

tn 



o 



o 



CD 



00 



00 
CD* 



to 



oo 

CN 

CN 
ID 



in 



o 

CL 



O 



o 

0) 



0) 

E 



E 



I 

CN 



ro 
I 

z 

CN 



0) 

E 



m 
I 

CM 



E 



^0 

I 

z 

CM 



E 



Cvj 



I 

Z 
CN 



0) 

E 



ro 
1 

CM 



E 

Cl. 



E 

CL 



I 

z 
CM 



I 

r 

CN 
CD 



E 



I 

z 
CM 



O 
CN 



cn 

o 
cm" 



u 
a 



0) 

E 



JO) 

E 



to 



CM 



to 



I 

z 
CM 



I 

z 
CM 



CM ro 

CM CM 



CM 
co" 



E 

CL 



z 
CM 



CM 



cn 

CM 

in 



o 
o 

03 

E 



J) 
E 

CO 



CD 



I 

z 

CN 



tn 

CM 



CO 
00 

oo" 

O 
CD 



o 
o 

E 



CO 

E 

cn 



to 



I 

z 

CM 



CO 
CM 



wo 99/26139 



9/10 



PCT/US98/24288 



CN 



o 



-2 

E 
o 



o 



1 
o 





o 








from 




JO 




o 
u 




+ 




N+1 








+ 






j5 


bit 






o 




15 
11 


II 


o 




bit 


-♦-» 
j5 



E 
S 

CO 



CM 



+ 

CM 
-f 

lo 
4- 

15 

(I 



E 
o 



-2 

15 

o 
u 



CM 



E 
o 



CM 



to 
15 



o 



E 
P 



CM 



4- 
2 



4- 



I) 

O 

15 



CM 

lo 

II 

CM 

15 



CM 



lo 
II 

ro 

15 



E 
p 



i2 
15 



4- 

2: 



CM 
4- 



a 
o 
4- 



CM 



CM 



E 
15 



-O 



CM 

lo 
4- 
2: 

CM 



E 
S 

« 

•4--' 

15 

o 
u 

4- 

CM 
4- 

CM 

lo 
4- 



E 

V) 

15 

a 
O 



4- 

CM 



o 
o 

4- 



II 



CM 

15 

li 



CM 

4- 

CM 



CM 

4- 



4- 

CM 



to 
4- 
2 



u 
a 
u 

4- 



E 
o 

CO 

lo 



CO 

15 



o 

u 



E 
£ 

CO 

lo 



o 
u 
4- 



II 



J2t 



II 



4- 

CM 

15 
II 

CM 

15 



ro 
4- 



-4- 

CM 



ro 

lo 
II 

ro 

lo 



CM 



E 
2 

CO 

lo 

O 
O 

E 
p 



CO 



lo 



Ob 



E 

e 

CO 

15 



ro 
to 



CM 
CO 



II 



CO 
CO 

lo 

E 
2 

i2 
15 



ro 

CO 



E 
o 

CO 

15 



o 
u 

E 
p 



o 
o 



0) 
CO 



CO 

lo 

c 
o 

CT» 

"c 

o 
u 



0) 

> 
o 



CO 

lo 

>s 
o 
c: 

CT» 

c 
o 
o 



o 

CO 

lo 



CO 

o 



ro 

CO 



to 

CD 



CM 
CO 



I 

ro 

CO 



co 



o 

CO 

*^ 
lo 



:5 



8 



O 
CO 



CO 
O 

<u 

JO 



wo 99/26139 



10/10 



PCT/US98/24288 



CO 
CO 



5 



0) 






0) 




























to 


in 































o-»-cM(0'<*-intDr>oooi®. 



in 



O O II 



0) 

(0 



< 



CO 



5 



^8 



m 

CO 

a> 





















a> 






0) 

L. 
«4— 


CM 


a> 










o 







o ^ 



i 





















o 




£ 






0) 




in 










J? 

























< 



0*-CMro^in<Df^ 



CO O) ^ 



:S8 11 
^ a> 
2 tt^ 

^ CO 

0) o 



CO 

c 

e o 
< o 

0) 



CO 



0) 
0) 



a> 



0) 
0) 



0) 

9 



(D 



03 



5 



o 
E 

■5^ 



INTERNATIONAL SEARCH REPORT 



[memational application No. 
PCT/US98/24288 



r\. CLASSIFICATION OF SUBJECT MATTER 

IPC(6) :G06F 12/00. 13/00; GllC 15/00 

US CL :7U/108. 126, 129. 170, 173 
According to [nlerna(ion;il Paient Classificaiion (IPC) or lo bah national ciassificiiiinn and IPC 



B. FIELDS SEARCHED 



Minimum documentacion searched (classification system followed by classification symbols) 

U.S. : 711/108, 126. 129. 170, 173 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the inlemaiional search (name of data base and. wliere practicable, search terms used) 
APS, IEEE PublicaUons 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of documcm. with indication, where appropriate, of the releviini passages 



Relevant to claim No. 



Y. P 


US 5,806,083 
and cols. 1-11 


A. P 


US 5,796.974 


A, P 


US 5,787.458 


Y, P 


US 5.761,714 


Y, P 


US 5,752.260 
and 3. 


Y. P 


US 5,706,224 



US 5,706.224 A (SRINIVASAN et al.) 06 January 1998, cols, 1-6 



1-20 

I 
I 

1-20 
1-20 

1-20 



Px Further documents are listed in the continuation of Box C. See patent family annex. 



* Speotal caiegorMt or cited docutneuu: 

'A* document deflning Ihe g«n>rBl sl8t« of ihs art which is not oontidtred 

to b* of particular ralevanee 

'B* «arlt«r doeumanl publiih«(t on or iftcr the mtemational tiling data 

'L* doouinanl whieh may throw doubts on priority elaiint*) or which b 

cited to eilabli^ lh< publication date of another citation or other 
■pecial reason (as spcciried) 

'O' document referring to an oral disclosure, ute. exhibition or other 

means 

"P" document published prior lo the intemsuon&l filing date but later ihsn 
the priority rfme claimed 



Ulcr document published aflcr the tntemsiionet filing diie or priorilx 
date end not in eonfliet with the applioetion but cited to understand 
the principle or theory underlyitig the inrentton 

document of particular relevance; (he claimed invention cannot be 
coniidered novel or cannot be consideied to invoke in inventive step 
when (he document i» taken alone 

doeuinent of ptrtioular relevenoe; the elaimed invention cannot be 
eontidered to involve an inventive step when the doeumeni is 
combined with one or more other such doeumentt. such combination 
being obvious to a person skilled in the art 

document member of the same patent family 



Date of the actual completion of the international search 
14 JANUARY 1999 



Date of mailing of the intemaiionai search report 



Name and mailing address of the ISA/ US 
Commissioner of Patents and Trademarks 

Box PCT 

Washington. D.C. 20231 
Facsimile No. (703) 305-3230 



Authorized officer 

JOHN CABECA 
Telephone No. (703) 308-31 f6 




INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US98/24188 



C (Continuaiion). DOCUMENTS CONSIDERED TO BE RELEVA^r^ 



Category* 


Citation of document, with iixiication. where appropriate, of the relevani passages 


Relevam to claim No. 


Y 
Y 


US 5.423,015 A (CHUNG) 06 June 1995. abstract and cols. 1-21. 

US 5.383.146 A (THREEWITT) 17 Januaiy 1995. abstract and 
cols. 1-7 


1-20 
1-20 



Form PCT/IS A/210 (cominuation of second sheetXJidy 1992) * 



